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Abstract 

We establish asymptotic normality for estimators of the additive regression compo- 
nents under random censorship. To build our estimators, we couple the marginal 
integration method Newey (1994) with an initial Inverse Probability of Censoring 
Weighted estimator of the multivariate censored regression function introduced by 
Carbonez et al. (1995) and Kohler et al. (2002). Asymptotic confidence bands are 
derived from our result. 
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1 Introduction 



Censored data arise in many statistical application domains, especially in epi- 
demiology and reliability. When studying the relationship between a depend- 
ing variable and covariates, nonparametric estimates are of particular interest 
in the presence of censored data, because no scatter plots can be drawn to 
detect the possibly complex form of this relationship. Several estimators have 
already been proposed to estimate the regression function. The key idea is 
to transf orm the observed da t a and derive so-called synthetic data e stima- 
tors (see 



Buckley and Jamesl (119791 )). For instance, iFan and Gijbelsl (119941 ) 



developed a local version of the parametric Buckley and James estimator, 
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while ICarbonez et all ( 1995 ) and iKohler et all ( 2002 ) studied a nonparamet- 
ric Inverse Probab il ity of Censoring Weighted [I.P.C.W.] estimator (see also 



Brunei and Comte! (120061 )1 



However, in most situations, the encountered variables (typically the onset of 
a disease in epidemiology) are related to numerous factors or predictors. Con- 
sider the particular case of the construction of a disease risk prediction tool, 
that is, the estimation of the probability of developing the disease, given a 
number (about ten generally) of predictors. In this setting, classical nonpara- 
metric estimators, such as kernel-type estimators, are unsuitable because of 
the well-known curse of dimensionality: for instance, the rate of convergence 
for nonparametric estimators of the conditional survival function is i n creas- 



in g in the regressor dimension in a censored setting (see iDabrowskal (119951 ) 



or 



Deheuvels and Derzko ( 20071 )). just like it is the case in an uncensored one 



Hardle (Il990l ) and the relevant references therein) . One common so- 



(see, e.g. 

lution to get round this issue is to work under the additive model assumption, 
when possible. In the uncensored case, several methods have been proposed to 
estimate the additive regression functio n . We shall evoke, among others, the 
met hods based on -B-spline s (see Stond ( 119851 )). on the backfitting algorithm 
(see iHastie and Tibshiranil (1199010 and on mar ginal integration (see, e.g . , 
Newevl (Il994h. iTiOstheim and Auestadl (Il994f l or iLinton and Nielsen! (11995( 1 ). 



In iFan and Gijbeld ( 11994 ) , it is shown that the backfitt ing ideas also applies 
to cens ored data. Here, following the ideas introduced in lDebbarh and Viallon 
( I2007al ). we make use of the marginal integration method, coupled with initial 
multivariate nonparametric I.P.C.W. estimators to provide an estimator for 
the additive censored regression function. At this point, it is noteworthy that 
the developments we propose here for I.P.C.W -type estimators shall apply 
with minor modifications to cope with other synthetic data estimators. 



In form er works, we established the mea n-square convergence rate dDebbarh and Viallon 
(l2007a[ )). i;he uniform consiste ncy rate dDebbarh and Viallonl (l2007bh ) and a 
uniform law of the logarithm (iDebbarh and Viallon (l2007cf n for such estima- 
tors of the additive regressio n function in the presence of censored data. In 



Debbarh and Viallonl (j2007d ). we also proposed a method to construct simul- 



taneous almost certainty bands, that is confidence bands which contain the 
true value of the additive component with asymptotical probability one, uni- 
formly over the predictor domain. Obviously, those kinds of confidence bands 
may be very conservative, and classical confidence intervals derived from an 
asymptotical normal law may be desirable. To construct such intervals is one 
of the aims of the present paper, which is organized as follows. After having 
recalled how to construct estimators for the additive components in Section 
|2j we establish their asym ptotic normality in Section [3j This limit law com- 
pletes th e one obtained by ICamlong-Viot et all (2000 ) i n the uncensored case 
(see also lLinton and Nielsen! (119951 ) and lSperlich et all ( 120021 ) ). Then, in Sec- 
tion m we show how to obtain confidence intervals from the aforementioned 
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convergence in law. Finally, Section [5] is devoted to the proof of our result. 



2 Notations 



Consider the triple (Y, C, X) defined in R+ x R+ x R d , d > 1, where Y is the 
variable of interest, C the censoring variable and X = (Xl, ...,Xd) a vector of 
concomitant variables. Throughout, we work with a sample {(Yi, Q, Xj)i<j<„} 
of independent and identically distributed replicae of (Y, C, X). Actually, in 
the right censorship model, Yi and Cj are not observed and only Zj=min{Y,Cj} 
and Si = I{^<ci}, 1 < i < are at our disposal, standing for the indicator 
function of E. Accordingly, the observed sample is V n = {(Zi,8i, Xi),i = 
1, . . . , n}, and for all t e E+, we set F(t) = P{Y > t), G(t) = P{C > t) and 
H(t) = P(Z > t) the right continuous survival functions pertaining to Y, C 
and Z respectively. 

Further denote by ip a given real measurable function. In this paper, we are 
concerned with the regression function of ip(Y) evaluated at X = x, in the 
particular case where this function is additive, 

m^{x) = E (V>(T) | X = aj) , V x = (x u ...,x d ) e R d , (2.1) 

d 

= t M + Y i m i (x t ). (2.2) 

e=i 

In view of (12. 2p . the so-called additive components mi, £ = l,...,d, as well 
as the constant \i are defined up to an additive constant. Therefore, it is 
quite common to work under the identifiability assumption Em^A^) = 0,£ = 
1, d, which ensures that \x = Eip(Y). 



Let (h n ) n >i and (hi )n ) n >i, i — 1, . . . , d, be d+1 sequences of positive constants 
and denote by / the densi ty func t ion o f the coy a riate X Introduce f n th e 
Akaike-Parzen- Rosenblatt flAkaikd fll954f ). [parzen] fll962l ). [RosenblattI fllOod) ) 
estimator of / pertaining to K and h n , 



n 



hi 



x 



X, 



where A' is a given convolutio n kernel in M. d . We furth er denote by G* the 
Kaplan-Meier estimator of G (jKaplan and Meier Namely, adopting 
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the convention Y\ = 1 and 0° = 1, we have, for all y G 

-N n (Zi)-l^' 



GUy) = i - II 



Ki<n 



N n (Zi 



(2.3) 



with pi = l {Zx < y] (l - Si) and N n (y) = £™ =1 %,>s/}- 



To estimate the regression function defined in (12. ip (at this point, we do not 
work unde r the additive as s umpt io n (12"72"|) vet), we propose the f o llowing esti 



mator (see 



Carbonez et al. ( 1995 ) . Jpebbarh and Viallonl ( 2007a ). Jones et al 



(I1994D . iKohler et al\ (12002( 1 and Maillot and Viallonl (120071 )1 Denoting by 
Ki, . . . , Kd d given kernels defined in K, we introduce 



± WnAx) s jm where (2.4) 



i=l 



Adopting the convention 0/0 = 0, fni n is properly defined since G* n {Zi) = if 
and only if Zj = Z( n ) and 5( n ) = 0, where is the fc-th ordered statistic asso- 

, n and <!>(*.) is the Sj corresponding 



ciated to the sample (Zi, Z n ) for k 



to Z 



(k) 



Z r 



For all x = (x\, ..,Xd) G K d and every £ — 1, ...,<i, further set cc_^ = (xi, .., 
.., Xrf). Under the assumption ( 12.21) . w e will estimate the additiv e 
components y ia the marginal integration method ( Linton and NielsenI ( 1995 ). 
Newey Towards this aim, we introduce d given density functions de- 

fined in M, gx,...,g d . Then, setting q(x) = Y\f = iqe(xe) and, for £ 
q-i(x_£) = Y[j^qj(xj), consider, for £ = 1, ...,d, the quantities 



Ve(xe) = m i ,(x)q_ e (x_ £ )dx_ £ - mJx)q(x)dx. 
It is straightforward that the two following equalities hold, 



l,...,d, 



(2.5) 



r]i(x E ) = mi(xi) - / m e (z)qi(z)dz, £=l,...,d, 



d . 

rriip(x) =J2r]i(xe) + m^(z)q(z)dz. 

i=\ jRd 



(2.6) 
(2.7) 



In view of (12.61) and (12.71) . r\i and me are equal up to an additive constant, 
in such a way that r\i is actually an additive component too, which fulfils an 
alternative identifiability condition. 

Remark 2.1 Observe that r]e = m% for the particular choice qe = ft, fg de- 
noting the density function pertaining to Xe, £ = l,...,d. However, fe being 
unknown in most situations, we most often have r\i ^ me. 
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From (12.41) and (12. 5p . a natural estimator of the £-th additive component rje 
evaluated at xg is given, for I = 1, d, by 



I, m^,n(x)q-.i(x-i)dx-i- / m\ (x)q(x)dx, (2.8) 



from which an estimator add of the additive regression function can easily 
be deduced (see (12.71) ). 



d 

™^,add( X ) = E Vti X t) + / „ m^, n (x)q(x)d,X. ( 2 - 9 ) 



3 Hypotheses and Results 



These preliminaries being given, we introduce the assumptions to be made to 
state our result. First, consider the hypotheses pertaining to (Y,C, X). 

(CI) C and (X, Y) are independent. 
(C.2) G is continuous on R + . 

(C.3) There exists a finite constant M such that sup^ \if)(y)\ < M. 
[CA) is fc-times continuously differentiate, k > 1, and 
<9 fc , , 

< oo; t = 1, d. 



sup 3 



m^x) 



As mentioned in iGross and Lail (119961 ). functionals of the (conditional) law 



can generally not be estimated on the complete support when the variable 
of interest is right-censored. Accordingly, we will work under the assumption 
(A) that will be said to hold if either (A)(z) or (A)(zz) below holds. Denote 
by T L = sup{t : L(t) > 0} the upper endpoint of the distribution of a random 
variable with right continuous survival function L. 

(A)(z) There exists a r < T H such that ip = on (r , oo). 

(A)(zz) (a) For a given k/(2k + 1) < p < 1/2, | J Th F'P^-^dGl < oo; 
(6) T F <T G ; 

(c) n 2p-1 /^ n | l°g(^,n)| — ► oo, as — ► oo, for every i — 1, c?. 

Remark 3.1 (i) zs noteworthy that assumption (A)(ii) allows for consid- 
ering the estimation of the "classical" regression function, which corresponds 
to the choice ip(y) = y. On the other hand, normality for estimators of func- 
tionals such as the conditional distribution function JP(Y < 7o|X) can be ob- 
tained under weaker conditions, when restricting ourselves to tq < Th- 

(ii) When working under (A)(ii), assumption (C.3) can be weakened. In this 
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setting, it is indeed sufficient to work under (C.3) below. 

(C.3) There exists a finite constant M such that sup y<ro \i/j(y)\ < M. 

(Hi) It is also noteworthy that condition (C.I) is stronger tha n the conditional 



independence of C and Y given X, under which \Beran\ l(198q) worked t o build 



an estimator of the conditional s urvival function (see also Dabrowska (199 A) 



and Deheuvels and Der zko Note, however, that the two assumpt- 



ions 



coincide if C and X are independent. In other respect, to use Beran's local 
Kaplan-Meier estimator, the censoring has to be locally fair, that is P[C > 
t | X = x] > whenever F[Y > t | X = x] > 0. Here, we basically only 
suppose that G(t) > whenever F(t) > 0, which is, on its turn, a weaker 
assumption. For a nice discussion on the di fference between Beran' s estimator 
and Carbonez et al. 's estimator, we refer to \Carbonez et al. \l99<j ). 



Denote by Ci, ...,Cd, d compact intervals of R and set C = C\ x ... x C d . For 
every subset £ of IR d , and any a > 0, introduce the a-neighborhood £ a of S, 
namely £ a = {x : inf^ \x — y\^d < a}, \ ■ \^d standing for the Euclidean norm 
on R d . 

We will work under the following regularity assumptions on / and fe, £ = 
l,...,d (fi denoting the density function of X#, as in Remark 12.11) . These 
functions are supposed to be continuous. Moreover, we assume the existence 
of a constant a > such that the following assumptions hold. 

(F.l) \fx e E CI, fe{xi) > 0, £=l,...,d, and \/x £ C a , f(x) > 0. 
(F.2) f is /c'-times continuously differentiable on C a , with k' > kd. 

Regarding the kernels K and Ki, i = 1, . . . , d, defined in M d and R respec- 
tively, they are assumed to be bounded, integrable to 1, with compact support 
and such that, 

(K.l) Kf is of order k, £ — 1, d. 
(K.2) K is of order k' . 

In addition, we impose the following assumptions on the given integrating 
density functions q-i and qg, £ = 1, d. 

(Q.l) q-i is bounded and continuous, £ = 1, d. 
(Q.2) For £ = 1, d, qt has a compact support included in Ci and has 
[k + 1) continuous and bounded derivatives . 

Finally, turning our attention to the sequences (/i n ) n >i and (/ty n )n>i, £ = 
1, d, we will work under the conditions below. 

(H.l) h n = d f j , for a fixed < d < oo. 
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(H.2) h i>n = cn~ l K 2k+1 \ £=1, d, for a fixed < c < oo. 

Some more notation is needed for the statement of our results. Set, for all 
x G C and every 



l,...,d, 



H*l) = 77 / u h Ki{u)du({-l) k mf\xi) - [ m e {z)qf\z)dz\ , (3.1) 
k\ jr \ Jr ) 



and 



where 



j R Kj{u)du 
cfe(x e ) 



f(X-i\Xi) 



(3.2) 



H(x) = E 



GOO 



X = x 



(3.3) 



We have now all the ingredients to state our main result in Theorem [3J] below. 



Theorem 3.1 Under the conditions (C.l-2-3-4), (#1-2), {K.l-2), (Q.l-2) 
and (H. 1-2), we /iawe, /or every £ = 1, ...,d and all xe G 



w */(2fc+i){^*( x< ) _ n e (x e )} - b e (x e ) 



AT(0,1) 



(3.4) 



This result naturally implies the following; corollary, whic h correspond to a 
refinement of Theorem 3.1 in iDebbarh and Viallonl (j2007al ). 



Corollary 3.1 Under the conditions of Theorem \3.1[ we have, for every 
1, d and all xe G Cg, 



n 2*/(2*+l) E ^ (a:/) _ = b 2 {xe) + + o(1) 



The proof of Theorem 13.11 is postponed to Section [51 A rough outline of our 
arguments is as follows. First, we consider the case where both the density 
function / of X and the function G are known (see Lemma 15.11 below). In 
this setting, we show that the estimator defined in (12.41) can be written as 
an uncensored estimator of the regression function, and the result of Lemma 



15.11 follows from similar arguments as those developed in ICamlong-Viot et al. 



(2000) in the uncensored setting. Then, from the result of lAngo-Nze and Rios 



( 120001 ). we can extend Lemma \5. II to the case where only G is known. Finally, 
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the uniform consistency of G* n (see, for instance, Foldes and Rejto ( 198 lb ) 
enables us to conclude the demonstration of Theorem 13. II in the general case. 



4 Application : construction of confidence intervals 



4-1 Construction of confidence intervals 



Under the assumption of Theorem 13.11 it is straightforward that, for 



1, d, the interval 



% We, 



±7l -v(2fc+i)n.96 



<rt{xt) 



bt(xt)) 



is a confidence interval for r^(a^), at an asymptotic 95% level, for all X£ e C?. 
However, as can be seen in the definition (13. ip . the bias term hi is a "complex" 

(k) 

function involving rri£ and m\ . Bias estimates could be built in by using esti- 
mates of these quantities, but this would result in quite complex algorithms to 
derive the confidence bands. Following the ideas which have been proposed to 
construct confidence bands for kern e l-type estimators of the regression func- 
tion (see, e.g., Section 4.2 in iHardld (119901 )). our aim is now to make the bias 
term vanish. A close look into the proof of Theorem 13.11 reveals that, if the 
bandwidth he >n is chosen proportional to n~ l ^ 2k+1 > times a sequence that tends 
slowly to then the bias vanishes asymptotically. In this case, for £ — 1, d, 
the interval 

'ffi(xt) ± 1.96<T i (x t )n- k K 2k+1 > 

provides a confidence interval for rje(xi), at an asymptotic 95% level, for all 
X£ G Ce. Then, given any consistent estimator of o~i (making use, for in- 
stance, of kernel estimators for the density functions involved in ae), we con- 
clude by Slutsky's Theorem that 



f)g{xt) ± 1.963^(^)71' 



-fc/(2fc+l) 



(4.1) 



provides a confidence interval for r]i(xi) 
Xi e Ce. 



at an asymptotic 95% level, for all 



4-2 Illustration : a simple simulation study 



In the following paragraph, we present some results from a simulation study, 
which especially enables to compare th e just given confidence bands with the 
almost certainty bands we proposed in iDebbarh and Viallonl (12007a ) . 
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We worked with a sample size n = 1000, and considered the case where X = 
(X 1 ,X 2 ) e R 2 (i.e. d = 2) was such that X 1 ~ U(-l, 1) and X 2 ~ U(-l, 1), 
where U(a,b) stands for the uniform law on (a,b). Set m^x) = 0.5 x cos 2 (a;) 
and m 2 (x) = 0.5 x sin 2 (x). We selected ip — 2{.<o.9} 5 an d considered the 
model E[-^(F)|Xi = £i,X 2 = x%] = mi(x\) + m 2 (x 2 ). Under this model, 
the variable Y was simulated as follows. For each integer 1 < i < n, let 
Pi = mi(xi 5 j) + m 2 (a; 2j j) where Xj^ is the i-th observed value of the variable 
Xj, j = 1, 2. Note that < pi < 1 for every 1 < i < n. Each 3^ was then 
generated as one U (0.9— 1+0.9— p^) variable. Following this proceed ensured 
that P(Yj < 0.9 \Xi = Xi) = Pi = m^x^i) + m 2 (x 2j i). Regarding the censoring 
variable, we generated an i.i.d. sample C±, C n such that Ci ~ U(0, 1). This 
choice yielded, a posteriori, JP(5 = 1) ~ 0.2. We used Epanechnikov kernels 
(for K, Ki and K 2 ) and selected q\ = g 2 = 0.5 x (in such a way that 

the additive component to estimate were rj^j = rrij — 0.25, j = 1,2). As for 
the bandwidth choice, we opted a priori for hi 0Q0 = ft^iooo — ^2,1000 — 0.2. 

Graphical representations of the results are provided in Figure [TJ It can be 
seen that the confidence intervals derived from the asymptotic normality are 
less conservative than the ones obtained from the uniform law of the loga- 
rithm. The price to pay is however that the true function does not belong to 
the former intervals at every x G [0,1]. Therefore, in most applications, we 
recommend the construction of both confidence bands to assess the form of 
the relationship between the dependant variable and covariates. 




(a) First component (b) Second component 

Figure 1. Results of the simulation study for (a) the first additive component and 
(b) the second additive component : true additive components (blue solid line), their 
estimates (black dashed line), the 95% confidence intervals (black dotted lines) and 
the almost certainty bands (red dotted lines). 
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5 Proof Theorem I37D 



Here we present the detailed proof of Theorem 13.11 Only the proof for I = 1 
is presented, the proof for the d — 1 remainder components being similar and 
then omitted. 



5.1 The case where both f and G are known 



Recall the definitions (12. 4p and (12.81) of mi n and rfl respectively. Further 

denote by m^^ n {x) and fj^Xi) the versions of and rf f respectively, in the 
case where both / and G are known. Namely, we have 

m,, n (x) = ±W n , l (x) 6 -^ with HUttH " 11 
and 

ffei x e) = / m i , )n (x)q_ ( ,(x_ e )dx_ e - m^ n (x)q(x)dx. (5.2) 
Consider the function \1/ : R 2 — > R such that, 



^c) = I{ ^^ C) , forall(y,c)GR 2 . (5.3) 
G(y A c) 

In view of (15. lft and ( 15. 3ft . we have 



m^z) = £ W^Or)*^, Q). (5.4) 
i=i 

In the sequel we will make frequent use of a conditional argument, along with 
the independence assumption (C.l), that especially enables us to obtain the 
following kind of result. 



™*(X) :=E W y,c)|x)= E {Mp|x} 



V< c} |x,y 



X 



lG(y) 

= nv(X). (5.5) 

Combining this last result with the fact that the quantity \l/(y, Cj) is observed 
(i.e., uncensored) for all % = 1, n, we have the following particular appealing 
property : m^ tn (x) turns out to be an uncensored estimator of the regression 
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function m^. This property enables to treat the particular case where G is 
known with arguments similar to those used in the uncensored case. 



We will first establish the following result, which correspond to Theorem 13.11 
in the case where / and G are known. 

Lemma 5.1 Assume (C. 1-2-3-4), (K.l), (Q.1-.2) and (H.2) hold. Then, for 
every £ = 1, . . . , d and all xg G Cg, 



n fc/(2fc+1) {^(x£) - r] £ (x e )} - bjjxj) 



JV(0,1). 



(5.6) 



Proof. In view of the discussion ab ove, we will mostly borrow the arguments 
developed in the uncensored case by lCamlong-Viot et al. fl2000h . Towards this 
aim, introduce the following quantities. 



* n (Yi,Ci) =*{Yi,Ci 
0(u_O = 



^^i^ri/dwi^)^' (5 - 7) 



=2 ^- ra 



n t— k 4 ~\ — )Q-i( x -i) dx -i, 

- 1 l=2 n £,n \ 11 



£.n 



1 ^ ^ n (Yi,Ci) ^ (x x -X iA 



h u 



m(xi 



)=E(^ n (Y ll C l )\X hl =x 1 

r d 

C n = /i + y^_ i rnj{uj)Q{\x^i)d\x. 



C n 

c 



3=2 

m^ >n (x)q(x)dx, 
mi(xi)qi(xi)dxi. 



Next observe that 



(5.8) 

(5.9) 
(5.10) 
(5.11) 

(5.12) 
(5.13) 



^i(ari) -r]i(xi) = {ai(xi) - rh(x x )} + R(6 n - C n - C). 
and set 



(5.14) 



k 

P2 = c k | J ^mf\x l + v 1 h 1 )K 1 (v 1 )dv 1 'jq 1 (x 1 )dx 1 . 



(5.15) 
(5.16) 
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From fl5.14p - fl5.15p - fl5.16p . and because of Slutsky's theorem, the proof of 
Lemma 15.11 will be completed as soon as the four following results will be 
established. 



n k/{2k+l) E (£ n _ Cn _ C } = p 2 + o(1); 

n 2fc/(2fc+i) Var(C7„) = o(l), 
n 2k/(2k+i) Cov(C n ,a 1 (x 1 )) = o(l). 



Proof of J 5. 17\ ): In a first step, our aim is to show that 



(5.17) 
(5.18) 

(5.19) 
(5.20) 



n/ii in {ai(2a) -E(cti(xi)} — > J\f[0,crf(xi)j. 
We claim that 



(5.21) 



nhi jn Var(ai(xi)) — > o"i(xi) as n — ► oo, 
where o"^(xi) is as in (13.21) . Recalling (15.91) . note that 



(5.22) 



v (- ( w 1 *n(Yi,Ci) 

Var(ai(Xl)H ^gnA(^v 
i ™ 



with 



3l> 



1 

n 



i=i 
1 



C t ) r ^ fX 1 -X iil 



hi,, 



fl{Xi,\)hi n \ h\ >n 



^l,n V hi tn J ff(Xi t i) 



Using classical conditioning arguments and recalling the definition ( 15.101) of 
rh, it is straightforward that 



ri, n (zi) 

$l,n(^l) 



/m 



hi,, 



-Kl 



hl, n 

'x 1 -u 1 \ E(^ 2 n (Xi,Ci) \X iA = Ul ) 



hl, n 



h{ui) 



dui. 



12 



In other respect, the definitions (15. 7p and (15.101) . when combined with the 
argument used in (I5.5p . yield 



\fh{u 1 )\ = \E{* n (Y i ,C i ) \X i>1 =u 1 ) 

\G{Zi) /(X<,_i| 
/ m^,(u)^(u_ 1 )ciu_i 

7lRd-l 

in such a way that 



Xii = ui 



< oo, 



— r? n (xi) — > as n — > oo. 

n 



(5.23) 



Moreover, using once again the argument used to derive (I5.5P and keeping in 
mind the definition (13. 3p of iJ, it is easy to derive that, under (CI), 



E(* 2 n (Y i ,C i )\X i>1 = x 1 ) 



\G(Zi)J V/(X<,_i|X, 



v g(^) y 



jRd-i / 2 (u_i|Xi) 

S 2 (u-i 



Xi ; i — x\ 
Xii = x\ 



H(u) 



/(u-i | an) 



(5.24) 



Next, making use of the classical change of variable V(h^ n — Ug — along 
with a Taylor expansion of order k (which is rendered possible by (Q.2)), we 
readily have by (K.l), for a given < 6 < 1, 



x ft I — [ Xt u Ui ) qi{xi)dx„i - g_i(u_i) 



, -2 hl.,n 



V t^ l Ln (k) 



{6Vfh^ n + U() 



dv-i 



o(l), 



Combining (I5.24p and (I5.25p . we get 



H(x) q ~^ X - 1 ' 



(5.25) 



i 1 f{x-i | Xi) 



daj_i + o(l). (5.26) 
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In addition, setting $(xi) = , 

jRd-1 j{X-l | X\ 

the change of variable vihi n = x\ — U\, we obtain 



u, , 9-1(^-1) 7 j 

H(x)— : r aa;_i and using once again 



/i(xi - Mi) 



E(^(Yi,Ci)| Ai = xi-Mi)^i 



'E(^(y. ;C -.) |X iil = x!-Mi) 



/i(a?i - hivi) 



I (if 1 



fi(xi) 



Kl{v\)dv\. 



But, by (C.3), (-KM), (F.l) and (Q.l-2), it is easily shown that the quantity 
|E(f {Yi,Ci)\ X iA = u^/hiux) - $(xx)/f x (x x )\ is bounded. Therefore, (OS!) 
when combined with Lebesgue's dominated convergence Theorem enables us 
to conclude that 



fi(xi) 



Kl{v\)dv\. 



From fl5T23|) and fl5T27j) . the claim (I5T22D is proved. 



Now, we set 



(5.27) 



Ti 



1, 11 



1 V n (Y t ,C t )^ (x 1 -X itl 



and 



Var(T,, 



nVax(ai(xx)). 



Ti 



■ i,n 



ET 5 ; 



(5.28) 



For all £ > 0, we have 



rp2 



< 



Mi 



-p 



T 



ns 2 ^v^Sn'- £ U hl^ns^ V v^^n 
Mi E(7; 2 J 



> £ 



< 



< 



MlE(T: 



2 ) 



[a\{x x ) + o(l))% 2 n 4 



< 



M, 



(5.29) 



where Mi and M 2 are two finite and positive constants. Combining (I5.29P with 
the fact that Tj^/s 2 — > (which follows fr om (I5.22P and (15.281) ). we can apply 
the normal convergence criterion (see, e.g. iLoevd (119631 ) . p. 295) to obtain 
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1 n 

]T fen ~ m,n) AT(0, 1). (5.30) 
\/ns n i=1 

Finally, readily comes from (iOl . (15^21) . (E2HD and (1CTD . 



Now, our aim is to evaluate the term |m(xi) — Ed!(xi)|. First, from (15. 7p . 
(JSIBD and flBTTUD . note that 



m(x l )=E[ty n (Y i} C i ) X ijl = x 1 
*{Y t ,Ci] 



E 



/(X.j,_i|X, 



^(Xj _ x ) X i;1 = xi 



Then, using a conditioning argument along with the independence assumption 
(C.l), we get 



'E(^Q)PQ) 
m(xi) = E<{ — — — - g(Xi i) X il = x 1 



/(Xi -ilX^i) 

= / mJxi, u_i)^(u_i) du_i. 
Thus, by (K.l) and (C.4), a Taylor expansion yields 

E(ai(xi)) — m(xi) 

-fh(ui)Ki ( — 7 -^\dui — m(xi) 



m(a;i — V\h\^) — fn{x\) K\{v\)dv\ 

m^(xi - vih hn , u_i) - m^(xi, u_i) ^(u_ 1 )du_iiTi(ui)^i 



A;! 



G(xi-i)dxi-iKi{vi)dvi 



By combining this last result with (15.211) . we conclude to (I5.17P -D 



Proof of Ii5.18\) . Keep in mind the definitions (15. lip and (I5.12p of C n and C n . 
Then, according to Fubini's Theorem and under the additive model assump- 
tion, 
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E(C n - C n ) = E j m^ >n (x)q(x)dx - fx - J ^ ^ mj(uj)G{a~i)du-i 

JR - =2 

~t ^ ^l,n ^ ^ ^l,n / 

„ d 

~ / y]m i (u i )0(u_.i)du_i 



3=2 

-m\(ui)Ki [ \qi{xi)dxidu\. 



But, by (C.4) and (K.l) and using a Taylor expansion, we get, 

E(4 -C n )-C 

qi(xi)mi(xx + hi, n vi)Ki{vi)dv\dxi — C 

qi(xi)[m 1 (xi + h ln Vi) — mi(xi)}K 1 (vi)dvidxi 



hl,n v l (k) ( . , x 



\)dv \dx\ + o(/i ln ) 



fc! 

= /4& + o(/4), (5.31) 
which allows us conclude to (I5.18I) .D 

Proof of nrm) . In view of the definitions (jOi . flO) . CT) and fl5TT2l . using 
the boundedness of (which is ensured by (Q.l-2)) and \I> (which is ensured 
by (C.3)) along with the fact that, by (-F.1), / is bounded away from 0, we 
have, for a given M3 > 0, 

Var((7 n ) < J§-E( f kJ Xi ~ Xi ' 1 )q 1 (x 1 )dx 1 ) 2 = o(-), (5.32) 

which naturally implies (15. 19[) .□ 

Proof of ( 15. 200 Using the Cauchy-Schwartz inequality, we infer from (15.221) 
and (15321) that, 



Cov^.aiCxi)) < (Var C7 n ) 1/2 (Var a 1 (x 1 )) 1/2 = O (n" 1 /^ 1 ) 
which implies ( I5.20I) .D 
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As already pointed out, the proof of Lemma \5. II is readily completed by com- 
bining fl5~T7j) . (I5TT8D . fl5TT9|) and fl5\20|) . □ 



5. 1? T/ie case where f is unknown but G is known 



The key idea in thi s case is to use the uniform consistency of f n (see, e.g., 
Ango-Nze and Rioa (120001 )) along with the following decomposition, 



1 _ 1 fn-f 
fn J fn f 



(5.33) 



When the density / is unknown and G is known, the additive components 
estimates are defined as follows, for I = 1, . . . , d, 



Vi( x i) = / m 1 p n (x)q_ e (x_ e )dx_ e - / m^ n (x)q(x)dx. (5.34) 



where 



m^ n {x) =^2Wn,i(x) Si ' lZ ' 



G(Z t ) 

We will establish the following result. 

Lemma 5.2 Under the hypotheses of Theorem \3 . 1\ we have 



(5.35) 



sup \fji(xi) - Tj t {xi)\ = O 



/log n 
nh d n 



a.s. 



(5.36) 



Proof. First note that the term (nhf j' 1 J27=i \ K (( X - X i)/^i,n)| is almost 
surely uniformly bounded on C under the assumptions we made on K and 
/. Moreover, / and then f n (for n large enough) are bounded away from 
(see (F.l)). Then, in view of the definitions (12. 4p and (15.11) . along with the 
decomposition (15.331) . we get, by (F.2) and (H.l), that, for a given C\ > 0, 



supJ2\ W iA x ) ~ W iin (x)\ <CiSup \f n (x) - f(x) 



(5.37) 



where we used the following result, due to lAngo-Nze and Riod (120001 ). 
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sup|/„(aO-/(aO| = 



\\ogn 



nh d n 



a.s. , under (H.l), (F.2) and (K.2). 



Next, under the assumptions (A), (C.2) and (C.3), we have m.aXiip(Zi) /G(Zj) 
< oo. Thus, from (15.21) . (15. 34ft and (15.351) . we conclude that, for a given Ci > 0, 



sup \f}i(xi) - fj £ (x e ) \ <2sup m^ n (cc) - m^ n (x) 



which is Lemma [5. 21 □ 



5.5 T/ie case where both f and G are unknown 



We have the following decomposition. 



sup \rf t (xz) - rji(xi) \ < sup \ff t {xi) - fje(x E ) \ + sup \fje(xj>) - r] e (x e )\ 

xidCi xg&Zg xg£C( 



Lemma 5.3 Under the assumptions of Theorem \3.1\ we have 
sup \fje{x e ) - fjt(x t )\ = o(n- k/(2k+1) ) a.s. . 



(5.38) 



PROOF. Observe that 



sup \ffH(x t ) -rjt(xi)\ < 2sup|m^ n (cc) -m^ n (x) 



(5.39) 



First consider the case where (A)(i) holds. Under the assumptions of Theorem 
13.14 we have 



\ m i>,n\ X ) -«V,n< 



X 



<MX)|Wn,i(aj)|sup 

y<ro 



G(y) G* n (y) 



i=l 

n i 

< MY, \Wn,i(x)\ sup \G(y) - G* n (y)\su V 

t^x y<ro y<r G* n {y)G{y) 



(5.40) 
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where M is as in (C.3). Since tq < Th, the iterated law of the logarithm of ? 
ensures that 

sup \G(y) - G* n (y)\ = o(J^^) a.s. . (5.41) 

Besides, by the conditions imposed on K and /, the term Yh=\ \W n j{x)\ is 
almost surely uniformly bounded. Combining this last result with (I5.4ip . it 
follows that 



sup \ml n (x) - m^ n (cc) 



0(J hghgn \ a.s. . 



xec 



n 



(5.42) 



From (I5.39P and (15.421) , we readily conclude to the result of Lemma 15.31 in the 
case where (A)(z) holds. 



In the case where (A)(ii) holds, the proof follows from t he same lines as abo ve, 
making use of either the iterated law of the log arithm of iGu and L^ilfll99nh (if 
(A)(ii) holds with p = 1/2) or Theorem 2.1 of IChen and Lei fll997h (if (A)(ii) 
h olds with k/ (2k + 1) < p < 1/2) instead of the iterated law of the logarithm 
of lFoldes and Reitol fil98lh . The details are omitted. □ 



Finally, putting the results of Lemma 15.11 Lemma 15.21 and Lemma 15.31 all 
together achieves the demonstration of Theorem 13.11 
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